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DESCRIPTION 
Human H37 protein and cDNA encoding the Protein 



Technical Field 

This application relates to human H37 protein and to cDNA encoding the 
protein. More particularly, this application relates to human H37 protein that is 
an activity-controlling subunit for the protein Cdc7 controlling the replication of 
human cells; to human gene encoding the protein; to an antibody to the H37 
protein; and to a method for controlling the proliferation of human cells using 
such genetic engineering material and antibody. 

Background Art 

Proliferation of cells is initiated when a liquid factor called a growth factor 
is bound to the receptor on the ceft surface and a signal for proliferation is 
transmitted into the cell. Accordingly, for an artificial induction of proliferation of 
incubated cells, methods where an excessive amount of a growth factor is added 
to a cell medium, where a receptor which is not inherently owned by the said cell 
is expressed on the cell surface and a factor which is specific for the receptor is 
added to a medium, etc. have been carried out. Further, in suppressing the cell 
proliferation, methods where competing molecule, antagonist or the like to the 
receptor protein is added to the medium to suppress the binding of the receptor to 
the growth factor, etc. have been carried out. 

On the other hand, in the case of the cell where a proliferation signal is 
issued by binding of the receptor to the growth factor, a cycle in which its genomic 
DNA is replicated, uniformly distributed to daughter cells and then divided is 
repeated. Such a cycle is called "cell cycle" especially for eukaryotes. The cell 
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cycle is basically classified into four phases. Thus, S phase when chromosomal 
DNA is replicated; M phase when the replicated chromosome is divided by a 
spindle body and then cytoplasm is divided; Gl phase which is a period from M 
phase finishes until S phase begins; and G2 phase which is a period from S phase 
finishes until M phase starts. Transition from Gl phase to S phase is particularly 
strictly controlled and DNA replication takes place only once in S phase. 

It has been confirmed from the studies in yeast and higher eukaryotic 
cells that cyclin-dependent kinase plays a critical role in the cell cycle progression 
(Nature 292:558-560, 1981; Cell 66:731-742, 1991; Nature 349:338-393, 1991; 
Science 257:1958-1961, 1992; Bioassays 17:471, 1995). Further, from a genetic 
analysis in yeast, it has been clarified that another serme/threonine kinase plays 
an essential role in the initial stage of S phase (Gl/S transition). Thus, 
characterization of cdc7 mutation which was isolated as one of the cell division 
cycle mutants (J. Mol. Biol. 59:183-194, 1971), it revealed that the Cdc7 protein 
kinase functions immediately prior to chromosomal replication and that, during S 
phase, that is necessary for activation of origins (Mol. Cell Biol. 6:1590-1598, 
1986; Genes Dev. 15:480-490, 1998; Genes Dev. 15:491-501, 1998). It has been 
also clarified that the Cdc7 kinase activity is dependent upon the presence of a 
regulatory subunit, Dbf4 (Genetics 131:21-29, 1992; Mol. Cell. Biol. 13:2899-2908, 
1993). Expression of Dbf4 is periodic and is regulated by both at the 
transcriptional and post-translational levels (Exp. Cell Res. 180:419-428, 1989). 
The increase in Cdc7 kinase activity at the Gl/S boundary is at least accounted 
for by the elevated expression of Dbf4 in late Gl phase (Mol. Cell. Biol. 
13:2899-2908, 1994; Exp. Cell Res. 180:419-428, 1989). In addition, since Dbf4 
interacts with replication origins in vivo (Science 265:1243-1246, 1994), it has 
been suggested that the Cdc7 may trigger S phase by directly activating the 
replication initiation complex assembled at the origins. 

In addition, the inventors of this application had already isolated kinases 
related to yeast Cdc7 from Schizosacchatromyces prombe, Xenopus, mouse and 
human (J. Biol. Chem. 273:23248-23257, 1998; EMBO J. 16:4340-4351, 1997; 
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EMBO J. 14:3094-3104, 1995), and pointed out that eukaiyotic chromosomal 
repKcation is regulated by a conserved mechanism involving this family of kinase. 

From the findings in yeast and higher eukaryotes as mentioned above, it 
5 is expected that an artificial control of the cell proliferation is possible by means of 
the regulation of the Cdc7 kinase activity in cells, which is entirely other means 
than conventional methods using operation of growth factor/ receptor binding. 

However, the inventors had also found that the putative human homolog 
10 of Cdc7, huCdc7, possesses only a very low level of kinase activity when 
overexpressed in mammalian cells while a baculovirus expressed form of huCdc7 
is inactive. 

Under such circumstances, the inventors of this application have 
15 investigated a human cDNA library with a presumption of the presence of a 
regulatory subunit for human Cdc7, succeeded in isolating the cDNA encoding a 
novel protein which regulates the kinase activity by binding to huCdc7 and named 
the said protein encoded in this cDNA as H37 protein. 

20 An object of the invention is to provide the novel protein obtained by the 

inventors in an industrially applicable form. 

Another object of the invention is to provide a human gene encoding the 
protein and to provide a material for genetic operation including cDNA derived 
25 from the gene, an antibody to the protein. 

Still another object of the Invention is to provide a method for the artificial 
control of proliferation of human cells using the above material for genetic 
operation. 

30 
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Disclosure of the Invention 

This application provides human H37 protein having an amino acid 
sequence of SEQ ID NO: 1 or NO: 2. 

5 

This application also provides human H37 protein having an amino acid 
sequence where one or more amino acid residue(s) in the amino acid sequence of 
SEQ ID NO: 1 or NO: 2 is/ are deleted therefrom, substituted therefor or added 
thereto. 

10 

This application further provides a human gene encoding the 
above-mentioned human H37 protein; cDNA of said human gene which has a 
base sequence of SEQ ID NO: 3 or NO: 4; and DNA fragment comprising a partial 
sequence of those cDNAs. 

15 

This application furthermore provides a recombinant vector having the 
above-mentioned cDNA and an antibody against the human H37 protein. 

This application still further provides a method for promoting the 
20 proliferation of cells, which comprises introducing the above-mentioned cDNA or 
the DNA fragment together with expression regulatory sequence into cell, and a 
method for suppressing the proliferation of cells, which comprises introducing the 
above-mentioned antibody into the cell. 

25 

Brief Description of Drawings 

Fig. 1 is the result of a western blotting measuring the 
coimmunoprecipitation of H37 with huCdc7 expressed in mammalian cells. 
30 Lanes 1-4, immunoprecipitates; lanes 5-7, whole cell extract. Upper and middle 
pamels, immunoprecipitated with anti-huCdc7 antibody No. 1; lower panel, 
immunoprecipitated with anti- myc antibody. Extracts were prepared from Cos7 
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cells transfected with huCdc7 in combination with HI (lanes 2 and 5), HI 8 (lanes 
3 and 6), H37 (lanes 4 and 7) or huCdc7 alone (lane 1). Western blotting was 
conducted with anti-myc antibody (upper panel) or an anti-huCdc7 antibody No.l 
(middle and lower panel). 

5 

Fig. 2 is a result of the western blotting measuring antibodies against H37 
protein and association of huCdc7 and H37 in vivo. Nuclear extracts prepared 
from Cos7 ceUs transfected with myc-tagged H37 cDNA were blotted with 
anti-H37C (lane 1), anti-H37N (lane 2), anti-H37Cpep (lane 3) or anti-myc (lane 4) 

10 antibody. The arrow indicates the myc-tagged H37 protein, which carries 63 
amin o acids derived from 5' non-coding region in addition to the myc-tag. 
Immunoprecipitations from CEM extracts using either anti-huCdc7Cpep (lanes 5 
and 6) or anti-H37Cpep (lanes 7 and 8) were separated on gel electrophoresis and 
blotted with huCdc7 monoclonal antibody (4A8). The symbols - and + indicates 

15 absence and presence, respectively, of each antigen peptide during 
irnmunoprecipitation. Lanes 9-13; immunoprecipitates, prepared from nuclear 
extracts of HeLa cells by anti-huCdc7 No.l (lane 9), anti-huCdc7 monoclonal 
antibody 4A8 (lane 10), anti-H37C (lane 11), anti-H37N (lane 12) or anti-H37Cpep 
(lane 13), were blotted with anti-H37Cpep. 

20 

Fig. 3 shows the result of irnmunoprecipitation by the use of anti-huCdc7 
antibody No.l (lanes 1-5) or anti-myc antibody (lanes 6-10) from the extract of 
Cos7 cells transfected with myc-tagged H37 alone (lanes 1 and 6), or together with 
wild-type huCdc7 (lanes 2 and 7), or kinase negative huCdc7 (lanes 3 and 8)." 
25 Wild-type huCdc7 alone (lanes 4 and 9) and kinase negative huCdc7 alone (lanes 
5 and 10) were included as control. 

Fig. 4 shows mobility shift of H37 induced by coexpression of wild-type 
huCdc7. Extracts were prepared from Cos7 cells expressing either wild-type or 
30 kinase negative huCdc7 together with myc-tagged H37. Immmunoprecipitates 
with anti-huCdc7 antibody No. 1 (lanes 1 and 2) or anti-myc antibody (lanes 3 and 
4) were blotted with anti-myc antibody (upper) or anti-huCdc7 antibody (lower). 
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In all cases, samples were run on 8% SDS-PAGE. 

Fig. 5 shows an amino acid sequence of the full-length H37 protein which 
is as same as the SEQ ID NO: 1. 

5 

Fig. 6 shows schematic representation of conserved regions between Dbf4 
and H37. The double-arrowed region on Dbf4 was reported to be sufficient for 
interaction with huCdc7. Solid and gray double-arrowed region on H37 indicates 
the portion essential for interaction with huCdc7 or that sufficient for activation of 
10 huCdc7 kinase activity, respectively. 

Fig. 7 is a schematic representation of N-terminal and C-terminal deletion 
derivatives of H37 proteins. The number at the end of each bar indicates the 
portion of the amino acid (corresponding to SEQ ID NO: 1) at the deletion endpoint. 
15 The striped region indicates Dbf4 motif-C. 

Fig. 8 shows lacZ activity of H37 deletion derivatives in two-hybrid assay 
with huCdc7. 

20 Fig. 9 shows the result of antibody coprecipitation method in which H37 

deletion derivatives and huCdc7 were co-expressed in COS7 cells and a complex 
formation was measured. 

Fig. 10 (a) is the result of northern analysis of H37 rnRNA expression in 
25 various tissues, and (b) is the result of a northern analysis of H37 mRNA 
expression in various cancer cell lines. 

Fig. 11 shows the results where WI38 cells in a resting phase are 
stimulated by addition of 10% of serum and DNA contents at various stages were 
30 analyzed by means of FACS. 

Fig. 12 shows the result of northern analysis of H37 and huCdc6 
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expression by using RNA extracted from cells of Fig. 1 1 (upper), and the graphs 
showing the relative expression amount of each mRNA (middle and lower). 

Fig. 13 shows graphs showing each cell cycle fraction of human CEM cells 
5 fractionated by an elutriation method. 

Fig. 14 shows the result of northern analysis for expression of H37 and 
cyclin E in each fraction of Fig. 13 (upper), and also shows the graphs showing the 
relative expression amounts of each mRNA (middle and lower). 

10 

Fig. 15 shows the result of FACS analysis for DNA content in which HeLa 
cells were stopped at latter G2 phase using nocodazole and then the cell cycle was 
synchronously shifted. 

15 Fig. 16 shows the result northern analysis for expression of H37 and 

cyclin E in each cell cycle of Fig. 14 (upper), and also shows the graphs showing 
the relative expression amounts of each mRNA (middle and lower). 

Fig. 17 shows H37 localization in cells measured by indirect 
20 immunofluorescent technique. The antibodies used are anti-H37C (A), anti-H37N 
(C) and a control antibody (E) while B, D and F were pictures stained with DAPI. 

Fig. 18 shows kinetics of induction of DNA replication after serum 
stimulation of KD cells as measured by BrdU incorporation. 

25 

Fig. 19 shows the results in which KD cells, synchronized by serum 
starvation, were microinjected with various antibodies at 12 hours after serum 
stimulation, and then the rate of the cells incorporating the BrdU after 16 hours 
was measured. The numbers show the rate of the cells which are conducting the 
30 DNA synthesis. 

Fig. 20 is microscopic pictures of examples of cells into which a mixture of 
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anti-H37Cpep antibody and antigen peptide was microinjected. The pictures 
show the incorporated BrdU (upper), injected antibody (middle) and cells (lower). 

5 Best Mode for Carrying Out the Invention 

The human H37 protein of this invention has the amino acid sequence of 
SEQ ID NO: 1 and is a protein molecule encoded in the sequence region from 
518th to 2541st of cDNA whose base sequence is shown in SEQ ID NO: 3. The 

10 H37 protein of this invention also has the amino acid sequence of SEQ ID NO: 2 
and is a protein encoded in the sequence region from 518th to 1222nd of cDNA 
whose base sequence is shown in SEQ ID NO: 4. SEQ ID NO: 3 and NO: 4 are 
cDNAs derived from mRNA transcribed from the same genomic gene but the cDNA 
of SEQ ID NO: 4 is in a different splicing form from SEQ ID NO: 3, and there is a 

15 deletion from 1 199th to 1259th of SEQ ID NO: 3. 

Those H37 proteins can be prepared by known methods such as isolation 
from human organs and cell lines, chemical synthesis of peptide based upon the 
amino acid sequence provided by this invention, and a recombinant DNA 

20 technique using cDNA fragments provided by this invention. For example, in 
case the H37 protein is prepared by means of a recombinant DNA technique, RNA 
is prepared from a vector having the cDNA fragment of this invention by an in vitro 
transcription and then an in vitro translation is carried out using the above as a 
template whereby an expression in vitro is possible. In addition, when the 

25 translational region is recombined to a suitable vector by a known method, it is 
possible to express the H37 protein encoded in cDNA in a large quantity in 
Escherichia coE, Bacillus subtilis, yeast, insect cells, animal cells, etc. 

When the human H37 protein of this invention is expressed in a 
30 microorganism such as E. coli, the translational region of cDNA of this invention is 
inserted into an expression vector having, for example, promoter, ribosome 
binding site, cDNA cloning site, terminator, origin replicable in microorganism, etc. 
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whereupon a recombinant expression vector is prepared, then host cell is 
transformed using the said expression vector and the resulting transformant is 
incubated whereupon H37 protein encoded in cDNA can be produced in a 
microorganism in a large quantity. Alternatively, it can be expressed as a fusion 
5 protein with other protein. The resulting fusion protein is cleaved by an 
appropriate protease whereupon only the protein portion encoded in cDNA can be 
obtained. 

When the human H37 protein of this invention is expressed in animal 
10 cells, the translational region of cDNA of this invention is recombined with 
expression vector for animal cells having, for example, promoter, poly(A) addition 
site, and splicing site, and then introduced into the animal cells whereupon the 
H37 protein of this invention can be expressed in animal cells. 

15 The human H37 protein obtained by the method as mentioned above can 

be used as an antigen for the preparation of antibody for suppressing the 
proliferation of cells through inhibition of the kinase activity of huCdc7. 

In addition, as will be confirmed by the Examples later, the human H37 
20 protein of this invention has almost no similarity in terms of structure to cyclins 
which have been elucidated up to now, but it may be regarded as a cyclin-like 
associating factor for huCdc7 kinase in such respects that its expression is 
regulated by a cell cycle and that, as a result of binding to an huCdc7 catalytic 
subunit, its kinase activity can be activated. Accordingly, since the H37 protein 
25 is believed to be a very important target factor in a signal transduction pathway for 
the cell proliferation induced by a growth factor, elucidation of how the expression 
of the H37 protein or activity thereof is regulated by the signal of cell cycle in Gl/S 
phases is expected to provide a big and novel finding for clarifying the molecular 
mechanism of cell cycle control of cell replication in animal cells. 

30 

The human H37 protein of the invention contains peptide fragments (five 
or more amino acid residues) containing any partial amino acid sequence of SEQ 
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ID NO: 1 or NO: 2. Such peptide fragments can be also used as an antigen for 
the preparation of antibody. 

The gene of the invention is a human gene encoding the above-mentioned 
5 human H37 protein and can be isolated from known genomic libraries using, for 
example, the cDNA of this invention or a partial sequence thereof as a probe. 

The cDNA of the invention is characterized in having a base sequence of 
SEQ ID NO: 3 or NO: 4, and can be cloned, for example, from cDNA libraries 

10 derived from human cells. cDNA is synthesized using poly(A) + RNA extracted 
from human cells as a template. The human cells may be either those excised 
from human body by a surgical operation or cell lines. The cDNA can be 
prepared by known methods such as Okayama-Berg method (Okayama, H and 
Berg, P., Mol. Cell. BioL, 2:161-170, 1982), Gubler-Hoffinann method (Gubler, U. 

15 and Hof&nan, J. Gene, 25:263-269, 1983) and Capping method (Kato, S. et al., 
Gene, 150:243-250, 1994). 

The human H37 protein of the invention is expressed in any tissue except 
brain and kidney and, therefore, when the human cDNA library prepared from 
20 human cells is screened using an oligonucleotide probe synthesized based on the 
base sequence of cDNA in SEQ ID NO: 3 or NO: 4, it is possible to easily prepare 
the sama clone of the invention. It is also possible that the desired cDNA is 
synthesized by a polymerase chain reaction (PCR) using such an oligonucleotide 
as a primer. 

25 

In general, polymorphism due to the difference among individuals is 
frequently noted in human gene. Accordingly, cDNA of SEQ ID NO: 3 or NO: 4 
where one or more nucleotide(s) is/ are added thereto and/ or deleted therefrom 
and/ or other nucleotide(s) is/ are substituted therefor is also covered by the 
30 invention. 

Similarly, the protein where one or more nucleotide(s) is/ are added 
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thereto and/or deleted therefrom and/or other nucleotdde(s) is/are substituted 
therefor as a result of those modifications is also covered by the invention so far as 
it has an activity of the protein having an amino acid sequence of SEQ ID NO: 1 or 
NO: 2. Further, variant protein where one or more nucleotide(s) is/ are added 
5 thereto and/ or deleted therefrom and/ or other nucleotide(s) is/ are substituted by 
an artificial means is covered by this invention as well. 

The DNA fragment of the invention covers a cDNA fragment (10 bp or 
more) conlaining any partial base sequence of SEQ ID NO: 3 or NO: 4, or a cDNA 
10 fragment comprising an antisense strand thereof. 

Antibody against human H37 protein of the invention can be prepared as 
a polyclonal antibody or a monoclonal antibody by a known method using protein 
per se or a partial peptide thereof. 

15 

A method for promoting the cell proliferation according to the invention is 
carried out by such a manner that a recombinant DNA comprising the cDNA 
having a base sequence of SEQ ID NO: 3 or NO: 4, or comprising a partial 
sequence thereof (such as a DNA fragment encoding 419 amino acid sequence 

20 region at the C terminal as shown in Example 3) and an expression-regulatory 
sequence (promoter and/or enhancer sequence for animal cells) is introduced into 
animal cells whereby the H37 protein having an amino sequence of SEQ ID NO: 1 
or NO: 2 is excessively expressed in cell nuclei. Introduction of the recombinant 
DNA into cells can be carried out by a known method such as a calcium 

25 phosphate method, a method where ribosome and erythrocyte ghost are used, an 
electroporation method, a method where retrovirus or adenovirus is used as a 
vector and a microinjection method using a glass pipette. Promotion of cell 
proliferation as such is useful, for example, in a large-scale preparation of the 
stem cell that is useful for the therapy of human diseases. Stem cells can be 

30 differentiated into other kinds of cells such as blood stem cell and nerve stem cell, 
and are able to produce a large number of cells of different cell type that constitute 
the human body. Therefore, transplantation of stem ceE in the diseases such as 
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leukemia is a very important therapeutic means. However, since no liquid factor 
for self-proliferation of human stem cells without differentiation has been 
identified yet, the preparation of stem cells in an amount sufficient for the therapy- 
has not been easy. According to the method of this invention, it could be possible 
5 that, as a result of manipulating the proliferation program in stem cells, the stem 
cells can be unlimitedly self-replicated and self-proliferated in vitro. In addition, 
such a promotion of cell proliferation in vitro could be also useful for the 
preparation of a large quantity of cells for introduction of gene for genetic therapy 
by ex vivo means. 

10 

The method for the suppression of cell proliferation according to the 
invention is carried out by injecting the above-mentioned antibody into cells. 
Alternatively, it may be carried out by inhibiting the expression of the ceU-mtrinsic 
H37 protein gene as well. An example thereof is a method where DNA encoding 
15 the ribo2yme sequence or the antisense sequence to the transcript of the gene is 
introduced into the cells. Suppression of the cell proliferation as such is expected 
to provide a novel means for suppressing an excessive proliferation of cancer cells 
for example. 

20 

Examples 

The invention will now be further illustrated by way of the following 
examples although the invention is not limited to those examples. 

25 

Example 1. 

H37 cDNA labeled with a Flag tag was ligated to the down stream of an 
30 SRalpha promoter and introduced into IL-3 (interleukin 3)-dependent proB cell, 
Ba/F3, to give a stable transformant cell line constitutively expressing the 
Flag-tagged H37 protein. In this cell line, rate of the cell at the late S phase or G2 
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phase where DNA synthesis was almost finishing or was finished in the population 
of the proliferating cells increased as compared with the control parental cell line. 
On the other hand, the parental cells stopped its proliferation by removal of IL-3 
and came into GO phase but, in the stable transformant cell line, rate of the cells 
5 existing in S phase significantly increased even after the removal of IL-3. This 
result shows that the constitutive production of the H37 protein promotes the 
shift from G0/G1 phase to S phase, or the progress of S phase itself. It also 
suggests that the progress of cell cycle can be operated by an artificial expression 
of the H37 protein or a derivative thereof. 

10 

Example 2. 

cDNA library of HeLa cell was prepared using a pGAD-GH vector and each 
15 vector was introduced into yeast strain CG1945 that has a recombinant plasmid 
with huCdc7 fused to DNA-binding domain of Gal4. As a result of a screening of 
about 3 x 10 s cells of transformed yeast, five clones where P-galactosidase was 
positive were obtained. DNA base sequence of the insert was determined and the 
database was searched whereupon all of them were novel cDNA. Among them, 
20 three clones were same having a base sequence of SEQ ID NO: 1. This cDNA was 
named H37. Other two were single clones and were named HI and H18, 
respectively. 

The interaction of huCdc7 with protein encoded in those positive clones 
25 was further investigated using a production increase system in animal cells. 
Thus, each expression plasmid of HI, H18 and H37 tagged with myc epitope was 
transfected to animal cell Cos7 together with a full-length huCdc7 expression 
plasmid. The result is as shown in Fig. 1. Although the H37 protein ,was 
coimmunoprecipitated with an antibody against huCdc7, HI and H18 proteins 
30 were not coprecipitated (cf. lanes 2-4 of the upper panel, Fig. 1). On the contrary, 
huCdc7 was coimmunoprecipitated by using anti-myc antibody alone in the case 
of the cells where myc-tagged H37 is co-expressed (cf. lane 4 of the lower lane, Fig. 
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1). From those results, it has been confirmed that only H37 cDNA encodes a 
protein efficiently interacting with huCdc7. 

Then, in order to investigate the intrinsic H37 protein, each antibody 
5 against N- or C-terminal regions of H37 (anti-H37N antibody or anti-H37C 
antibody) was prepared. Further, antibody against C-terminal oligopeptide of 
H37 (anti-H37Cpep antibody) and antibody against C-terminal oligopeptide of 
huCdc7 (anti-huCdc7Cpep antibody) were also prepared. After that, association 
of each antibody with intrinsic huCdc7 and H37 in cells was measured. The 

10 result is as shown in Fig. 2. Thus, all of the antibodies to H37 specifically react 
with the myc-tagged H37 protein of 90 kDa expressed in Cos7 cells (cf. lanes 1-4, 
Fig. 2). Complex prepared from human CEM cells was able to be coprecipitated 
using an affinity-purified anti-peptide antibody. It was confirmed by an 
immunoblotting using anti-huCdc7Cpep antibody that all of huCdc7 and H37 

15 immunoprecipitates contained huCdc7 (lanes 5 and 7, Fig. 2). This interaction of 
H37 with huCdc7 completely disappeared when antibody and peptide used as 
antigen for preparing the antibody were subjected to a pre-incubation in advance 
(cf. lane 8, Fig. 2). In extracts of HeLa cells, both anti-Cdc7 antibody and 
anti-H37 antibody were able to coprecipitate one polypeptide of 80 kDa that 

20 specifically reacted with anti-H37 antibody (lanes 9-13, Fig. 2). 

From the above result, it was found that intrinsic huCdc7 and H37 
protein were present in cells as a complex. 

25 

Example 3. 

In order to investigate whether H37 protein has an ability of activating 
huCdc7, huCdc7/H37 complex prepared by expression of myc-tagged H37 and 
30 wild-type or kinase negative huCdc7 in Cos cells was immunoprecipitated by 
anti-huCdc7 antibody or anti-myc antibody and then a kinase reaction in vitro 
was measured using GST-MCM3 fusion protein as a substrate. The result is as 
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shown in Fig. 3 and in Fig. 4. In the presence of wild-type huCdc7, 
phosphorylation in a good efficiency was observed for the MCM3 protein in both 
immunoprecipitate of anti-huCdc7 antibody and immunoprecipitate of anti-myc 
antibody (cf. lanes 2 and 7, Fig. 3). Further, two more phosphorylated proteins 
5 were observed and they were identified as transfected huCdc7 and myc-tagged 
H37 (data not shown). Since the phosphorylation as such was not detected at all 
in kinase negative huCdc7, it was confirmed that kinase activity of huCdc7 acted 
in the phosphorylation. However, kinase negative huCdc7 was also able to form a 
complex with H37 protein (cf. lanes 3 and 8, Fig. 3; lanes 2 and 4, Fig. 4). In 

10 addition, the mobility of H37 protein on gel electrophoresis was slow when a 
wild-type huCdc7 was co-expressed and detected as plural bands while such a 
shift was not observed in kinase negative huCdc7 (cf. lanes 1 and 3, Fig. 4). The 
band where the mobility was slow disappeared by a treatment with phosphatase 
and, therefore, it was confirmed to be a perphosphorylated H37 protein (data not 

15 shown). In addition, when huCdc7 and H37 protein were co-expressed in insect 
cells, it was possible to reconstitute a very strong kinase activity which was able to 
efficiently phosphorylate MCM2 and MCM3 proteins (data now shown). 

The above result shows that H37 protein activates huCdc7 kinase and 
20 further that H37 protein itself is phosphorylated by huCdc7 . 

Further, under those experimental conditions, mtrinsic H37 protein level 
was too low and in the case where only huCdc7 catalytic subunit was expressed 
and, therefore, the kinase activity was little (cf. lanes 4 and 9, Fig. 3). From those 
25 facts, it was corifirmed that H37 protein encoded regulatory cell unit of huCdc7 
and specifically activated its kinase activity. 

Example 4. 

30 

An amino acid sequence of the H37 protein (SEQ ID NO: 1) was analyzed. 
The result was that, as shown in Fig. 5 and Fig. 6, an amino acid sequence region 
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having a homology of 33% with budding yeast Dbf4 was found. This conserved 
domain (H37 motif C) was also present in the gene similar to H37 identified in 
mouse, fruit fly and fission yeast (cf. Fig. 6). Further, another amino acid 
sequence of H37 (H37 motif N) was conserved in the H37-related gene in mouse, 
5 fruit fly and fission yeast. However, the H37 motif N was conserved in a 
somewhat modified form on the Dbf4 protein of a budding yeast (cf. Fig. 6; data 
partially not shown). 

In order to determine the region on H37 protein essential for binding with 
10 huCdc7, a series of deletion derivatives of H37 at N- and C-terxninals as shown in 
Fig. 7 was prepared, each of them was expressed in yeast as a fusion protein with 
Gal4 activated domain and an interaction of each deletion derivative with huCdc7 
was investigated by means of a two-hybrid assay. The result is as shown in Fig. 8. 
As a result of deletion at N-terrninal, even a deletion of 255 amino acids at 
15 N-terminal did not affect the interaction with huCdc7 (AN2). However, when 50 
amino acids at N-terminal were further deleted whereupon the H37 motif C was 
deleted, an interaction with huCdc7 was completely lost (AN3). 

On the other hand, with regard to deletion from C-terminal, when only 20 
20 amino acids were deleted, the binding ability with huCdc7 decreased to an extent 
of about 60% (AC). In addition, when 243 or 369 amino acids were deleted from 
the C-terminal (AP2 and AB), the interaction lowered to an extent of about 10% of 
the full-length clone. API containing only 235 amino acids of N-terminal did not 
interact with huCdc7. However, 50 amino acids that were commonly present in 
25 AB and AN2 were not sufficient for an efficient interaction with huCdc7 (data not 
shown). 

A part of the above-mentioned H37 deletion derivatives was co-expressed 
in COS7 cells together with huCdc7 and it was confirmed by immunoprecipitation 
30 method whether it formed a complex with huCdc7 protein. The result was that, 
like in the result of a two-hybrid assay, it was confirmed that only H37 deletion 
derivatives of deltaB and deltaN2 formed a complex with huCdc7 (cf. Fig. 9). 
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The above result shows that H37 motif-C is essential for an interaction of 
H37 protein with huCdc7 catalytic subunit but shows that H37 motif C alone is 
insufficient. In the budding yeast, it was reported already that the region 
5 containing H37 motif C was sufficient for binding with Cdc7 (Mol. Cell. Biol. 
15:6775-6782, 1996). When an in vitro kinase reaction was carried out using 
those deletion derivatives, it was found that, although the extent was weak, only 
419 amino acids at the C-terminal containing Dbf4 motif-C was sufficient for 
activation of phosphorylating ability of huCdc7 {data not shown). 

10 

Example 5. 

Expression patterns of H37 mRNA in various human tissues and cancer 
15 cells were investigated by northern blotting. The result is as shown in Fig. 10 (a) 
and (b). By an H37 cDNA-specific probe, a transcribed product of 2.5 kb was 
detected in all tissues except brain and kidney and also in all cancer cells. That 
was contrary to the fact that, in brain and kidney, huCdc7 mRNA was relatively 
highly expressed (EMBO J. 16:4340-4351, 1997). Among the tissues tested, 
20 expression of H37 mRNA was highest in testicles and then in thymus, and it was 
reported by the inventors already that both of them were the tissues where the 
expression of huCdc7 catalytic subunit was particularly high as well (EMBO J. 
16:4340-4351, 1997). In the testicles, two different RMA bands of 6 kb and 4 kb 
were also detected {cf. Fig. 10 (a)) although it was ambiguous what the real 
25 substances therefor were. It was also confirmed that H37 mRNA was expressed 
in a very high level in nearly all cancer cells except lung cancer cell A549 (cf. Fig. 
10 (b)). That shows an important role of H37 protein in the cells having an active 
proliferating ability. 

30 

Example 6. 
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In order to investigate whether expression of H37 was regulated by cell 
cycle, human normal fibroblast cell WI38 was synchronized with GO phase by 
means of serum starvation (cf. Fig. 1 1), total RNA was prepared at various periods 
after addition of serum and the level of H37 mRNA was investigated by northern 
5 blotting. The result is as shown in Fig. 12. H37 mRNA level was low in the cell 
at the resting stage and, as the cells came near the boundary of Gl and S, it 
gradually increased and, after 20 hours from the serum addition, it became 
maximum. The expression pattern shown in Fig. 12 was similar to that of 
transcribed product of huCdc6 that is know to be induced by growth stimulation 
10 (Mol. Cell. Biol. 15:4215-4224, 1995; Proc. Natl. Acad. Sci. USA 94:142-147, 
1997). 

Further, in order to investigate the changes in expression of H37 in cell 
cycle, human CEM cells were fractionated by means of elutriation method (cf. Fig. 

15 13) and then a northern blotting was carried out (cf. Fig. 14). It was shown by the 
result thereof that H37 mRNA was low at Gl phase, increased from the late Gl 
phase to S phase, highly maintained during S phase and somewhat decreased in 
G2 phase although it was still kept high. In an experiment where HeLa cells were 
stopped at G2 phase by nocodazole and the cell cycle was synchronously shifted 

20 (cf. Fig. 15), it was also found that H37 mRNA decreased together with shift from 
G2 to Gl and increased again at shift to S phase (cf. Fig. 16). Similarly, 
expression of Cdc6 rose together with shift from Gl phase to S phase but there 
was a difference from H37 in that, as S phase proceeded, it decreased and, at G2 
phase it was suppressed to low. The result shows that the expression of H37 

25 mRNA changes even in the proliferating cell cycle, becomes maximum at S phase 
where it functions and is kept high throughout S phase. 

After that, in order to investigate the changes in H37 protein and huCdc7 
kinase activity depending upon H37 in cell cycle, amount of H37 protein in K562 
30 cells synchronized with nocodazole was analyzed by western analysis while Cdc7 
kinase activity was analyzed by phosphorylation reaction of GST-huMCM2 protein 
using anti-huCdc7 immunoprecipitate in the same manner as in Fig. 15. 
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Although H37 protein was not detected from Gl/M boundary to Gl phase, it 
started to be detected at the Gl/S boundary and detected throughout S phase. 
This was in contrast to the fact that cyclin E protein is detected already in Gl 
phase and, as S phase proceeds, the amount suddenly decreases. On the other 
5 hand, huCdc kinase activity was also detected in S phase only corresponding to 
the level of H37 protein. Further, H37 protein was detected as plural 
phosphorylated bands in S phase and was thought to be phosphorylated by 
huCdc7 that was probably associated. On the contrary, level of huCdc7 protein 
was almost constant throughout cell cycle. This result proves that the level of 

CO 10 huCdc7 protein varies in cell cycle and is kept high in S phase whereby huCdc7 

Pi kinase activity is also kept high in S phase. 

\i In order to further investigate the expression of H37, intracellular 

JU. localization of H37 protein in animal cells was measured. The result of indirect 

%t 15 fluorescent antibody method using two kinds of H37-specific antibodies was that, 
q in both HeLa and WI38 cells, mtrinsic H37 protein was observed as very clear 

H various spots in the nuclei (cf. Fig. 17). 

When those results and the intranuclear localization of huCdc7 catalytic 
20 subunit already reported by the inventors (EMBO J. 16:4340-4351, 1997) were 
taken into consideration, it was confirmed that huCdc7/H37 complex was a 
kinase localized in the nuclei and that its regulatory subunit H37 was expressed 
depending upon the stage of cell cycle. 

25 

Example 7. 

Function of the mtrinsic H37 protein in the shift from Gl to S in cell cycle 
was investigated by antibody microinjection method. Antibody against 
30 N-terminal 305 amino acids of H37 protein (anti-H37N antibody) and antibody 
against C-terminal oligopeptide (anti-H37Cpep antibody) were treated by affinity 
purification and those antibodies were microinjected into normal fibroblast cells 
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(KD cells) derived from human lip. KD cells were previously arrested at GO phase 
by means of serum starvation and then synchronously proceeded in cell cycle by 
re-addition of serum. Numbers of the BrdU-positive cells, which BrdU is a 
nucleotide derivative and is incorporated into the cells, were measured whereby it 
5 was investigated how much fractions of cells at S phase were present after various 
stages after addition of serum. The result is as shown in Fig. 18. It was 
confirmed that the cells started DNA synthesis after about 18 hours from serum 
addition and that, after 24 hours, almost 90% of the cells were in S phase. 

10 Accordingly, in this Example, antibody was microinjected at the period of 

after 12 hours after the addition of serum where the cells were still in Gl phase 
and, at 26 hours when cells would be of in S phase, the cells were fixed and the 
BrdU-positive cells were measured. The result is as shown in Fig. 19. Although 
70% of the cells into which anti-H37N antibody was microinjected did not shift 

15 into S phase, there was nearly no effect by the control antibody. Even by the 
microinjection of the anti-H37Cpep antibody, an effect of inhibiting the shift to S 
phase which was same as or even more than the anti-H37N antibody was 
observed. In addition, when anti-H37Cpep antibody and a peptide which was an 
antigen for the preparation of anti-H37Cpep antibody were simultaneously 

20 microinjected into the cells, 70% or more of the cells were shifted to S phase. 

Fig. 20 is a staining example of BrdU and microinjected antibody. The 
antibody was microinjected at the stage when the cells were from the mid to late S 
phase and, during that period, expression of H37 protein is believed to be low. 
25 The microinjected antibody efficiently bound to the freshly synthesized H37 
protein and, as a result, it is speculated to inhibit the transfer of H37 protein into 
nucleus. Those results strongly suggest the function of H37, i.e. the function of 
huCdc7/H37 complex is required for the progression of S phase of the animal 
cells. 



30 
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Industrial Applicability 

As fully illustrated hereinabove, the present invention provided human 
H37 protein, a regulatory subunit for Cdc7 that regulate replication of human 
5 cells, human gene encoding this protein as well as cDNA thereof, an antibody 
against H37 protein, and a method for controlling the proliferation of human cells 
using those genetic engineering materials and antibody. As a result, it provides 
potentially novel means for preparation of the necessary amount of stem cells or 
the like to be used for the therapy of various human diseases or for the 
1 0 suppression of proliferation of cancer cells. 
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CLAIMS 

I . Human H37 protein having an amino acid sequence of SEQ ID NO: 1 . 

5 2. Human H37 protein having an amino acid sequence where one or more 
amino acid residue(s) in the amino acid sequence of SEQ ID NO: 1 is/ are deleted 
therefrom, substituted therefor or added thereto. 

3. Human H37 protein having an amino acid sequence of SEQ ID NO: 2. 

10 

4. Human H37 protein having an amino acid sequence where one or more 
amino acid residue(s) in the amino acid sequence of SEQ ID NO: 2 is/are deleted 
therefrom, substituted therefor or added thereto. 

15 5. A human gene encoding the human H37 protein of claim 1 or 2. 

6. A cDNA of the human gene of claim 5, which has a base sequence of SEQ 
ID NO: 3 or a base sequence where one or more base(s) in the base sequence of 
SEQ ID NO: 3 is/ are deleted therefrom, substituted therefor or added thereto. 

20 

7. A cDNA of the human gene of claim 5, which has a base sequence of SEQ 
ID NO: 4 or a base sequence where one or more base(s) in the base sequence of 
SEQ ID NO: 4 is/are deleted therefrom, substituted therefor or added thereto. 

25 8. A DNA fragment comprising a partial sequence of cDNA of claim 6. 

9. A DNA fragment comprising a partial sequence of cDNA of claim 7. 

10. A recombinant vector having the cDNA of claim 6. 

30 

II. A recombinant vector having the cDNA of claim 7. 
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12. An antibody against the human H37 protein of claim 1 or 2. 

13. An antibody against the human H37 protein of claim 3 or 4. 

5 14. A method for promoting the proliferation of cells, which comprises 
introducing the cDNA of claim 6 or the DNA fragment of claim 8 together with 
expression-regulatory sequences into the cells. 

15. A method for promoting the proliferation of cells, which comprises 
10 introducing the cDNA of claim 7 or the DNA fragment of claim 9 together with an 

expression regulatory sequence into the cells. 

16. A method for suppressing the proliferation of cells, which comprises 
introducing the antibody of claim 10 into the cells. 

15 

17. A method for suppressing the proliferation of cells, which comprises 
introducing the antibody of claim 1 1 into the cells. 
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ABSTRACT 

This application provides human H37 protein having an amino acid 
sequence of SEQ ID NO: 1 or NO: 2; human gene encoding the protein; cDNA of 
5 the human gene which has a base sequence of SEQ ID NO: 3 or NO: 4; DNA 
fragment comprising a partial sequence of the cDNA; recombinant vector having 
the cDNA; antibody against human H37 protein; and a method for controlling the 
proliferation of cells by introducing the above DNA or antibody into the cells. 
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SEQUENCE LISTING 

<110> Japan Science and Technology Corporation 

<120> Juman H37 protein and cDNA encoding the protein 
. 5 <140> PCT/JP99/06067 

<141> 1999-11-01 

<150> JP No. 10-311408 

<151> 1998-10-30 

<160> 4 
10 <210> 1 

<211> 674 

<212> PRT 

<213> Homo sapiens 

<400> 1 

15 Met Asn Ser Gly Ala Met Arg Me His Ser Lys Gly 
1 5 10 

Gly lie Gin Val Lys Asn Glu Lys Asn Arg Pro Ser 

20 25 
Lys Thr Asp Asn Arg Pro Glu Lys Ser Lys Cys Lys 
20 35 40 

Lys Val Phe Tyr Leu Asp Leu Pro Ser Val Thr lie 
50 55 60 

Gin Lys Asp Me Lys Asp Leu Gly Gly Arg Val Glu 
65 70 75 

25 Lys Asp Me Ser Tyr Leu Me Ser Asn Lys Lys Glu 
85 90 
Gin Thr Leu Gly Arg Me Ser Pro Val Pro Ser Pro 
100 105 



His Phe Gin Gly 
15 

Leu Lys Ser Leu 
30 

Pro Leu Trp Gly 
45 

Ser Glu Lys Leu 

Glu Phe Leu Ser 
80 

Ala Lys Phe Ala 
95 

Glu Ser Ala Tyr 
110 
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Thr Ala Glu Thr Thr Ser Pro His Pro Ser His Asp Gly Ser Ser Phe 

115 120 125 

Lys Ser Pro Asp Thr Val Cys Leu Ser Arg Gly Lys Leu Leu Val Glu 

130 135 140 

Lys Ala lie Lys Asp His Asp Phe Me Pro Ser Asn Ser Me Leu Ser 
145 150 155 160 

Asn Ala Leu Ser Trp Gly Val Lys Me Leu His Me Asp Asp Me Arg 

165 170 175 

Tyr Tyr Me Glu Gin Lys Lys Lys Glu Leu Tyr Leu Leu Lys Lys Ser 

180 185 190 

Ser Thr Ser Val Arg Asp Gly Gly Lys Arg Val Gly Ser Gly Ala Gin 

195 200 205 

Lys Thr Arg Thr Gly Arg Leu Lys Lys Pro Phe Val Lys Val Glu Asp 

210 215 220 

Met Ser Gin Leu Tyr Arg Pro Phe Tyr Leu Gin Leu Thr Asn Met Pro 
225 230 235 240 

Phe Me Asn Tyr Ser Me Gin Lys Pro Cys Ser Pro Phe Asp Val Asp 

245 250 255 

Lys Pro Ser Ser Met Gin Lys Gin Thr Gin Val Lys Leu Arg Me Gin 

260 265 270 

Thr Asp Gly Asp Lys Tyr Gly Gly Thr Ser lie Gin Leu Gin Leu Lys 

275 280 285 

Glu Lys Lys Lys Lys Gly Tyr Cys Glu Cys Cys Leu Gin Lys Tyr Glu 

290 295 300 

Asp Leu Glu Thr His Leu Leu Ser Glu Gin His Arg Asn Phe Ala Gin 
305 310 315 320 

Ser Asn Gin Tyr Gin Val Val Asp Asp Me Val Ser Lys Leu Val Phe 
325 330 335 
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Asp Phe Val Glu Tyr Glu Lys Asp Thr Pro Lys Lys Lys Arg Me Lys 
240 245 350 

Tyr Ser Val Gly Ser Leu Ser Pro Val Ser Ala Ser Val Leu Lys Lys 
355 360 365 

5 Thr Giu Gin Lys Glu Lys Val Glu Leu Gin His lie Ser Gin Lys Asp 
370 375 380 

Cys Gin Glu Asp Asp Thr Thr Val Lys Glu Gin Asn Phe Leu Tyr Lys 
385 390 395 400 

Glu Thr Gin Glu Thr Glu Lys Lys Leu Leu Phe Me Ser Glu Pro Me 
10 405 410 415 

Pro His Pro Ser Asn Glu Leu Arg Gly Leu Asn Glu Lys Met Ser Asn 

420 425 430 

Lys Cys Ser Met Leu Ser Thr Ala Glu Asp Asp Me Arg Gin Asn Phe 
435 440 445 

15 Thr Gin Leu Pro Leu His Lys Asn Lys Gin Glu Cys Me Leu Asp Me 
450 455 460 

Ser Glu His Thr Leu Ser Glu Asn Asp Leu Glu Glu Leu Arg Val Asp 
465 470 475 480 

His Tyr Lys Cys Asn Me Gin Ala Ser Val His Val Ser Asp Phe Ser 
23 485 490 495 

Thr Asp Asn Ser Gly Ser Gin Pro Lys Gin Lys Ser Asp Thr Val Leu 

500 505 510 

Phe Pro Ala Lys Asp Leu Lys Glu Lys Asp Leu His Ser lie Phe Thr 
515 520 525 

25 His Asp Ser Gly Leu Me Thr Me Asn Ser Ser Gin Glu His Leu Thr 
530 535 540 

Val Gin Ala Lys Ala Pro Phe His Thr Pro Pro Glu Glu Pro Asn Glu 
545 550 555 560 
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Cys Asp Phe Lys 

Lys Val Lys lie 
580 

5 Asn Ala Glu Phe 
595 

Arg lie Cys Ser 
610 

Ser Glu Glu Lys 
10 625 

Ser Gly lie Cys 

Asn Leu Leu Thr 
660 

15 Gly Phe 
674 

<210> 2 

<211> 234 

<212> PRT 
20 <213> Homo sapiens 

<400> 2 

Met Asn Ser Gly Ala Met Arg lie His Ser Lys Gly His Phe Gin Gly 

15 10 15 

Gly Me Gin Val Lys Asn Glu Lys Asn Arg Pro Ser Leu Lys Ser Leu 
25 20 25 30 

Lys Thr Asp Asn Arg Pro Glu Lys Ser Lys Cys Lys Pro Leu Trp Gly 

35 40 45 

Lys Val Phe Tyr Leu Asp Leu Pro Ser Val Thr Me Ser Glu Lys Leu 



Asn Met Asp Ser Leu Pro 
565 570 
lie Leu Gly Arg Asn Arg 
585 

Asp Lys Arg Thr Glu Phe 
600 

Ser Pro Val Gin Ser Leu 
615 

Ser Glu Phe Leu Gly Phe 
630 

Asn Val Leu Asp lie Trp 
645 650 
Ala Phe Phe Ser Ser Pro 
665 



Ser Gly Lys lie His Arg 
575 

Lys Glu Asn Leu Glu Pro 
590 

lie Thr Gin Glu Glu Asn 
605 

Leu Asp Leu Phe Gin Thr 
620 

Thr Ser Tyr Thr Glu Lys 
635 640 
Glu Glu Glu Asn Ser Asp 
655 

Ser Thr Ser Thr Phe Thr 
670 
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50 55 60 

Gin Lys Asp Me Lys Asp Leu Gly Gly Arg Va! Glu Glu Phe Leu Ser 
65 70 75 80 

Lys Asp He Ser Tyr Leu He Ser Asn Lys Lys Glu Ala Lys Phe Ala 
5 .85 90 95 

Gin Thr Leu Gly Arg He Ser Pro Val Pro Ser Pro Glu Ser Ala Tyr 

100 105 110 

Thr Ala Glu Thr Thr Ser Pro His Pro Ser His Asp Gly Ser Ser Phe 
115 120 125 

10 Lys Ser Pro Asp Thr Val Cys Leu Ser Arg Gly Lys Leu Leu Val Glu 
130 135 140 

Lys Ala lie Lys Asp His Asp Phe He Pro Ser Asn Ser lie Leu Ser 
145 150 155 160 

Asn Ala Leu Ser Trp Gly Val Lys lie Leu His lie Asp Asp He Arg 
15 165 170 175 

Tyr Tyr He Glu Gin Lys Lys Lys Glu Leu Tyr Leu Leu Lys Lys Ser 

180 185 190 

Ser Thr Ser Val Arg Asp Gly Gly Lys Arg Val Gly Ser Gly Ala Gin 
195 200 205 

20 Lys Thr Arg Thr Gly Arg Leu Lys Lys Pro Phe Val Lys Val Glu Asp 
210 215 220 

Met Ser Gin Ser Pro Ala Val His Leu Met 
225 230 234 

<210> 3 
25 <211> 2780 
<212> DNA 
<213> Homo sapiens 
<400> 3 
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AATTCGGCAC GAGCTCTCTG AGGCTGCGCC AAGACCTGAA GCGGCGGACC GAGAGCCCGG 60 
GTCTGAGACT GAGAGAGCAA CGGAATGGAG GCGGGGTAGA GGCGGAAACA CAACCTGCAG 120 
GGCCAGAGCG AGGCGCGAGA AGGACGGCGG CGTGAGGGGG CGGGGCGCGC AGCGCGAGAA 180 
GGCAGGCACG AGGGGCGAGC GCGAGGCGGG GCACGGCGCG TGGCGTGAGA CGGGGCGGGG 240 

5 CGCGCGTATC GGCGCCGCGG CCGCGTGACG CGTTTTCAAA TCTTCAACCG CCGCAGCCCA 300 
CTCGTTTGTG CTTTGCGCCT TCCTCCTCCG CGCCTTGGAG CCGGATCCGG CCCCGGAAAC 360 
CCGACCTGCA GACGCGGTAC CTCTACTGCG TAGAGGCCGT AGCTGGCGGA AGGAGAGAGG 420 
CGGCCGTCCT GTCAACAGGC CGGGGGAAGC CGTGCTTTCG CGGCTGCCCG GTGCGACACT 480 
TTCTCCGGAC GCAGCATGTA GGTGCCGGGC GACTGCCATG AACTCCGGAG CCATGAGGAT 540 

10 CCACAGTAAA GGACATTTCC AGGGTGGAAT CCAAGTCAAA AATGAAAAAA ACAGACCATC 600 
TCTGAAATCT CTGAAAACTG ATAACAGGCC AGAAAAATCC AAATGTAAGC CACTTTGGGG 660 
AAAAGTATTT TACCTTGACT TACCTTCTGT CACCATATCT GAAAAACTTC AAAAGGACAT 720 
TAAGGATCTG GGAGGGCGAG TTGAAGAATT TCTCAGCAAA GATATCAGTT ATCTTATTTC 780 
AAATAAGAAG GAAGCTAAAT TTGCACAAAC CTTGGGTCGA ATTTCTGCTG TACCAAGTCC 840 

15 AGAATCTGCA TATACTGCAG AAACCACTTC ACCTCATCCC AGCCATGATG GAAGTTCATT 900 
TAAGTCACCA GACACAGTGT GTTTAAGCAG AGGAAAATTA TTAGTTGAAA AAGCTATCAA 960 
GGACCATGAT TTTATTCCTT CAAATAGTAT ATTATCAAAT GCCTTGTCAT GGGGAGTAAA 1020 
AATTCTTCAT ATTGATGACA TTAGATACTA CATTGAACAA AAGAAAAAAG AGTTGTATTT 1080 
ACTCAAGAAA TCAAGTACTT CAGTAAGAGA TGGGGGCAAA AGAGTTGGTA GTGGTGCACA 1140 

20 AAAAACAAGA ACAGGAAGAC TCAAAAAGCC TTTTGTAAAG GTGGAAGATA TGAGCCAACT 1200 
TTATAGGCCA TTTTATCTTC AGCTGACCAA TATGCCTTTT ATAAATTATT CTATTCAGAA 1260 
GCCCTGCAGT CCATTTGATG TAGACAAGCC ATCTAGTATG CAAAAGCAAA CTCAGGTTAA 1320 
ACTAAGAATC CAAACAGATG GCGATAAGTA TGGTGGAACC TCAATTCAAC TCCAGTTGAA 1380 
AGAGAAGAAG AAAAAAGGAT ATTGTGAATG TTGCTTGCAG AAATATGAAG ATCTAGAAAC 1440 

25 TCACCTTCTA AGTGAGCAAC ACAGAAACTT TGCACAGAGT AACCAGTATC AAGTTGTTGA 1500 
TGATATTGTA TCTAAGTTAG TTTTTGACTT TGTGGAATAT GAAAAGGACA CACCTAAAAA 1560 
GAAAAGAATA AAATAGAGTG TTGGATCCCT TTCTCCTGTT TCTGCAAGTG TCCTGAAAAA 1620 
GACTGAACAA AAGGAAAAAG TGGAATTGCA ACATATTTCT CAGAAAGATT GCCAGGAAGA 1680 
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TGATACAACA GTGAAGGAGC AGAATTTCCT GTATAAAGAG ACCCAGGAAA CTGAAAAAAA 1740 
GCTCCTGTTT ATTTCAGAGC CCATCCCCCA CCCTTCAAAT GAATTGAGAG GGCTTAATGA 1800 
GAAAATGAGT AATAAATGTT CCATGTTAAG TACAGCTGAA GATGACATAA GACAGAATTT 1860 
TACACAGCTA CCTCTACATA AAAACAAACA GGAATGCATT CTTGACATTT CCGAACACAC 1920 
5 ATTAAGTGAA AATGACTTAG AAGAACTAAG GGTAGATCAC TATAAATGTA ACATACAGGC 1980 
ATCTGTACAT GTTTCTGATT TCAGTAGAGA TAATAGTGGA TCTCAACCAA AACAGAAGTC 2040 
AGATACTGTG CTTTTTCCAG CAAAGGATCT CAAGGAAAAG GACCTTCATT CAATATTTAC 2100 
"5 TCATGATTCT GGTCTGATAA CAATAAACAG TTCACAAGAG CACCTAACTG TTCAGGCAAA 2160 
L GGCTCCATTC CATACTCCTC CTGAGGAACC CAATGAATGT GACTTCAAGA ATATGGATAG 2220 
" 10 TTTACCTTCT GGTAAAATAC ATCGAAAAGT GAAAATAATA TTAGGACGAA ATAGAAAAGA 2280 
ri AAATCTGGAA CCAAATGCTG AATTTGATAA AAGAACTGAA TTTATTACAC AAGAAGAAAA 2340 
CAGAATTTGT AGTTCACCGG TACAGTCTTT ACTAGACTTG TTTCAGACTA GTGAAGAGAA 2400 
-■j ATCAGAATTT TTGGGTTTCA CAAGCTACAC AGAAAAGAGT GGTATATGCA ATGTTTTAGA 2460 
"5 TATTTGGGAA GAGGAAAATT CAGATAATCT GTTAACAGCG TTTTTCTCGT CCCCTTCAAC 2520 
~l 15 TTCTACATTT ACTGGCTTTT AGAATTTAAA AAATGCATAC TTTTCAGAAG TGATAAGGAT 2580 
CATATTCTTG AAATTTTTAT AAATATGTAT GGAAATTCTT AGGATTTTTT TACCAGCTTT 2640 
GTTTACAGAG CCAAATGTAA ATATTAAAAA TAAATATTTG CAATTTTCTA CAGAATTGAA 2700 
TACCTGTTAA AGAAAAATTA CAGAATAAAC TTGTGACTGG TCTTGTTTTA CATTAAAAAA 2760 
AAAAAAAAAA AAAACTCGAG 2780 
2D <210> 4 
<211> 2719 
<212> DNA 
<213> Homo sapiens 
<400> 4 

25 AATTCGGCAC GAGCTCTCTG AGGCTGCGCC AAGACCTGAA GCGGCGGACC GAGAGCCCGG 60 
GTCTGAGACT GAGAGAGCAA CGGAATGGAG GCGGGGTAGA GGCGGAAACA CAACCTGCAG 120 
GGCCAGAGCG AGGCGCGAGA AGGACGGCGG CGTGAGGGGG CGGGGCGCGC AGCGCGAGAA 180 
GGCAGGCACG AGGGGCGAGC GCGAGGCGGG GCACGGCGCG TGGCGTGAGA CGGGGCGGGG 240 
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CGCGCGTATC GGCGCCGCGG CCGCGTGACG CGTTTTCAAA TCTTCAACCG CCGCAGGCCA 300 
CTCGTTTGTG CTTTGCGCCT TCCTCCTCCG CGCCTTGGAG CCGGATCCGG CCCCGGAAAC 360 
CCGACCTGCA GACGCGGTAC CTCTACTGCG TAGAGGCCGT AGCTGGCGGA AGGAGAGAGG 420 
CGGCCGTCCT GTCAACAGGC CGGGGGAAGC CGTGCTTTCG CGGCTGCCCG GTGCGACACT 480 

5 TTCTCCGGAC CCAGCATGTA GGTGCCGGGC GACTGCCATG AACTCCGGAG CCATGAGGAT 540 
CCACAGTAAA GGACATTTCC AGGGTGGAAT CCAAGTCAAA AATGAAAAAA ACAGACCATC 600 
TCTGAAATCT CTGAAAACTG ATAACAGGCC AGAAAAATCC AAATGTAAGC CACTTTGGGG 660 
AAAAGTATTT TACCTTGACT TACCTTCTGT CACCATATCT GAAAAACTTC AAAAGGACAT 720 
TAAGGATCTG GGAGGGCGAG TTGAAGAATT TCTCAGCAAA GATATCAGTT ATCTTATTTC 780 

10 AAATAAGAAG GAAGCTAAAT TTGCACAAAC CTTGGGTCGA ATTTCTCCTG TACCAAGTCC 840 
AGAATCTGCA TATACTGCAG AAACCACTTC ACCTCATCCC AGCCATGATG GAAGTTCATT 900 
TAAGTCACCA GACACAGTGT GTTTAAGCAG AGGAAAATTA TTAGTTGAAA AAGCTATCAA 960 
GGACCATGAT TTTATTCCTT CAAATAGTAT ATTATCAAAT GCCTTGTCAT GGGGAGTAAA 1020 
AATTCTTCAT ATTGATGACA TTAGATACTA CATTGAACAA AAGAAAAAAG AGTTGTATTT 1080 

15 ACTCAAGAAA TCAAGTACTT CAGTAAGAGA TGGGGGCAAA AGAGTTGGTA GTGGTGCACA 1140 
AAAAACAAGA ACAGGAAGAC TCAAAAAGCC TTTTGTAAAG GTGGAAGATA TGAGCCAAAG 1200 
CCCTGCAGTC CATTTGATGT AGACAAGCCA TCTAGTATGC AAAAGCAAAC TCAGGTTAAA 1260 
CTAAGAATCC AAACAGATGG CGATAAGTAT GGTGGAACCT CAATTCAACT CCAGTTGAAA 1320 
GAGAAGAAGA AAAAAGGATA TTGTGAATGT TGCTTGCAGA AATATGAAGA TCTAGAAACT 1380 

20 CACCTTCTAA GTGAGCAACA CAGAAACTTT GCACAGAGTA ACGAGTATCA AGTTGTTGAT 1440 
GATATTGTAT CTAAGTTAGT TTTTGACTTT GTGGAATATG AAAAGGACAC ACCTAAAAAG 1500 
AAAAGAATAA AATACAGTGT TGGATCCCTT TCTCCTGTTT CTGCAAGTGT CCTGAAAAAG 1560 
ACTGAACAAA AGGAAAAAGT GGAATTGCAA CATATTTCTC AGAAAGATTG CCAGGAAGAT 1620 
GATACAACAG TGAAGGAGCA GAATTTCCTG TATAAAGAGA CCCAGGAAAC TGAAAAAAAG 1680 

25 CTCCTGTTTA TTTCAGAGCC CATCCCCCAC CCTTCAAATG AATTGAGAGG GCTTAATGAG 1740 
AAAATGAGTA ATAAATGTTC CATGTTAAGT ACAGCTGAAG ATGACATAAG ACAGAATTTT 1800 
ACACAGCTAC CTCTACATAA AAACAAACAG GAATGCATTC TTGACATTTC CGAACACACA 1860 
TTAAGTGAAA ATGACTTAGA AGAACTAAGG GTAGATCACT ATAAATGTAA CATACAGGCA 1920 
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